Model Selection

Dynamic Visual Tokens

# Dynamic Visual Tokens

Ristretto is an innovative vision-language model that employs dynamic image token deployment technology, allowing flexible adjustment of image token quantities based on task requirements, surpassing previous generations in performance and versatility.

Transformers Supports Multiple Languages

Chat UniVi 7B V1.5

Chat-UniVi is a large language model with unified visual representation, capable of understanding both images and video content.

Chat-UniVi is a unified visual representation large language model capable of understanding both image and video content.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase